Final Project - Sales Assessment

Group 6

Replacing the NaN values with 0, Deleting the last column as it is totally blank

From above 2 box plot we found that STAR dealer have made a maximum revenue comapare to non-STAR dealer and there are some store which have made a quite high revenue which can show here as an outliers.

Boxplots of customer size A and E tells that Stores have made more revenue with large amount of customer size

There are several stores that have under 1000 customers.

There are some products qty which have highest percent of stores.

Large amount of store have made a revenue under 0.5 millions in the year 2021.

Large amount of store have made a revenue under 1 millions in the year 2020.

Top performing stores

Top 10 stores as per their total revenue. It'd be interesting to analyze their revenue for 2020 and 2021 along with the customer base.

By comparing the revenue of 2020 and 2021 of top 10 stores, found that almost revenue is increased in the year 2021 compare to 2020.

Mostly the stores have the large number of customers which have made the highest revenue except 2 store (56050560 and 596615966) have the small number of customers.

Found that Store id (56050560) dealer has attended the star program based on that we can say that star program is somewhat benifical to improve the revenue.

Top 10 stores by products' quantity

Finding the mean of PI based on customer size

Annual Revenue based on customer size

As the number of customers increased to come in the stores annual revenue also increase.

We have a more stores which have the small number of customers(B Size).

Annual revenue categorized by region

California has made the maximum revenue and Denver and Midwest have made the least revenue.

Checking the annual revenue distribution of California's stores based on years

Mostly revenue is increased in the year 2021 in California.

Checking the annual revenue distribution of Denver's stores based on years

Correlation of variables with Target Variable (Total Revenue)

Stores' performance as per Market Type

Analyzing average products sold based on market type

Taking total sum of each attribute to know how much they have help in driving the revenue

From the above plot we could see " CPOV Base Warranties" , "New Majors Stated Time", " Used Majors Stated Time" brings the most revenue.

Distributing top 10 stores based on New Majors Stated Time

Taking top 10 with highest revenue and checking which store gave the maximum sales

Top 10 Stores based on Annual Revenue

From above plot we could see store ID 736057360 have the highest grand total which means have the highest sales and gave the maximum profit, so we can consider this as a baseline store to compare other stores and find the defects so that it can be used to increase the revenue of other stores

Bottom 10 Stores based on annual Revenue

From the above plot we can see store id 882688826 and 264542645 are performing worst and we need to foucus on it.

Correlation

After removing strongly correlated variables like 2021 Revenue, 2020 Revenue, 2021 PEN, 2021 Products (Qty) , we got nearly 94% of accuracy which is quite good.